Cocktail Party Problem: Source Separation Issues and Computational Methods
نویسندگان
چکیده
The concept of the cocktail party problem (CPP) was coined by Cherry (1953). It was proposed to address the phenomenon associated with human auditory system that, in a cocktail party environment, humans have the ability to focus their listening attention on a single speaker when multiple conversations and background interferences and noise are presented simultaneously. Many researchers and scientists from a variety of research areas attempt to tackle this problem (Bregman, 1990; Arons, 1992; Yost, 1997; Feng et al., 2000; Bronkhorst, 2000). Despite of all these works, the CPP remains an open problem and demands further research effort. Figure 1 illustrates the cocktail party effect using a simplified scenario with two simultaneous conversations in the room environment. As the solution to the CPP offers many practical applications, engineers and scientists have spent their efforts in understanding the mechanism of the human auditory system, and hoping to design a machine which can work similarly to the human abStract
منابع مشابه
Improved Cocktail - Party Processing
The human auditory system is able to focus on one speech signal and ignore other speech signals in an auditory scene where several conversations are taking place. This ability of the human auditory system is referred to as the “cocktail-party effect”. This property of human hearing is partly made possible by binaural listening. Interaural time differences (ITDs) and interaural level differences...
متن کاملIca Based Blind Source Separation of Sound Using Matlab
Blind source separation is the separation of a set of signal from a set of mixed signals, without the aid of information (or with very little information) about the source signals or the mixing process. Source separation problems in digital signal processing are those in which several signals have been mixed together and the objective is to find out what the original signals were. The classical...
متن کاملMonaural Audio Speaker Separation Using Source-Contrastive Estimation
We propose an algorithm to separate simultaneously speaking persons from each other, the “cocktail party problem”, using a single microphone. Our approach involves a deep recurrent neural networks regression to a vector space that is descriptive of independent speakers. Such a vector space can embed empirically determined speaker characteristics and is optimized by distinguishing between speake...
متن کاملProbabilistic Binary-Mask Cocktail-Party Source Separation in a Convolutional Deep Neural Network
Separation of competing speech is a key challenge in signal processing and a feat routinely performed by the human auditory brain. A long standing benchmark of the spectrogram approach to source separation is known as the ideal binary mask. Here, we train a convolutional deep neural network, on a twospeaker cocktail party problem, to make probabilistic predictions about binary masks. Our result...
متن کاملCocktail Party Solutions: Mixing techniques
The human auditory system has an unparalleled ability to solve what is known as the cocktail problem: the task of perceptually separating superimposed acoustic sources in a noisy environment. Information about the auditory system can inform our choices of representation and computation. Recent progress has been made from several diier-ent approaches to the acoustic source separation problem. Co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016